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ABSTRACT 



A branch and bound technique is used to derive an algorithm for 
computing the alpha-width of aqy matrix of zeros and ones. Through 
computation of the 1-width of over 200 matrices of various dimensions, 
it is found that less than 20 minutes of computation time on the 
Control Data 160U digital computer is required to complete the compu- 
tation for most matrices. Applications of the algorithm to integer 
programming and to various targeting problems are described. Exten- 
sions are suggested for computing the minimal cost alpha-width, and 
for computing a minimal C-cover. 
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1 . A Targeting Problem. 



Consider the following rather specialized targeting problem? a 
communications network is given (Fig. 1), in which stations can 
communicate directly only with those stations to which they are con- 
nected by a link. Of course, this would be the case with any kind 
of land line network, but it is possible also, in the case of UHF 
radio communications, micro-wave relay systems, and even signal light. 
We ask this question? what is the minimum number of stations that 
must be destroyed so that the network is totally disrupted; that is, 
so that no pair of surviving stations can communicate? 

The answer is given in Figure 2; in which those stations targeted 
have been crossed out. It is perhaps surprising to note that the 
station most central to the network; the one directly connected to 
the greatest number of stations, is not targeted. In fact, if this 
station were included in the target list, we should be forced to 
target the four indicated targets anyway, and thus we would have been 
forced away from the optimal solution. 

Let us note, parenthet ical ly , that no claim is made that our 
targeting policy is the best one. It is quite probably valid, and 
indeed optimal, if the purpose of the attack is, for example, the 
total (and temporary) disruption of an enemy's warning system for 
the protection of a second strike to follow immediately. But assume 
that the network is a railroad system. It is quite possible that a 
policy of bombing junctions, switching yards, and accessible rail 
lines would have little lasting effect on the effectiveness of the 
transportation system because of the ready availability of repair 
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equipment and personnel. For example, in the interdiction of the 
French railroads prior to the invasion of Normandy in World War II; 
our bombing of marshalling yards and other junctions caused little 
disruption of rail traffic, although it did strain the repair capa- 
bilities of the rail system severely. On the other hand, when 
bridges over the Seine, Oise, and Meuse Rivers were added to the 
target list, results were spectacular. On 26 May, all routes over 
the Seine north of Paris were closed to rail traffic and remained 
closed for the next thirty days. By contrast, marshalling yards 
could be repaired in one or two days. (See pp 217-230; and 
especially p 228 of [6]). 

No matter, this simple problem will serve to illustrate the 
very general algorithm to be described in section 3; without re- 
quiring that cumbersome set-up procedures be learned before getting 
down to! work. 

The solution to this targeting problem was obtained without 
difficulty after the initial error of trying to include the central 
station (number 3) in the target list, merely by inspection of the 
network layout. It is unfortunate that so few communications net- 
works of nine stations and eight connecting links are of interest 
in a problem of this type. Clearly, if a network of interesting 
size were examined (let us say on the order of V~> stations and 35 
connecting links), the solution by inspection would be quite diffi- 
cult. Where, then, are we to look for a method of attack on this 
problem? 

It is well known from the theory of graphs, that every graph 
may be represented by an incidence matrix of zeros and ones; in fact 
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by any of several incidence matrices depending upon the purpose for 
the representation. [1]. For the purpose of this paper we will use 
the following terminology from graph theory? a node of a graph is 
the junction of two or more links of the graph (synonym: vertex ) ; 

an arc is a link between two nodes, and in this paper will be con- 
sidered to be without direction. We define the node-arc incidence 
matrix, A, of a graph, by construction as follows: List the nodes 

of the graph horizontally and the arcs vertically so that they are 

labels of columns and rows of the matrix, respectively. If the j un 

tl"l 

node is a terminal point of the i L arc, set a^. « 1. Otherwise, 
set a. . =0. The node-arc incidence matrix of the communications 

ij 

network of Figure 1 is displayed in Figure 3 . 

The targeting problem restated in graph theoretic terms is: 

Find the minimum number of nodes so that each arc of the graph has 
at least one of the nodes as a terminal. Since we already know the 
answer to this simple problem, it would be well to examine this 
solution applied to the node-arc incidence matrix. We construct a 
new matrix from the incidence matrix by including only those columns 
labelled with one of the nodes in the solution set. This matrix is 
displayed in Figure li. It contains the same number of rows as the 
original matrix, but has only four columns. We note that whereas 
there were two " 1 " *s in each of the rows of the incidence matrix 
(one for each terminal of each arc ) 5 there is only one u l u in the 
sub-matrix. 

A little reflection upon the above observation leads to a third 
formulation of the targeting problems given the node-arc incidence 
matrix of the communications network, find the smallest subset of 
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columns of the matrix with the property that each row is represented 
by at least one "1" in this subset of columns. But this smallest 
subset of columns is precisely what Fulkerson and Ryser call a 
minimal set of representatives for the (0,1) matrix, A| and the 
cardinality of this set is called the "width” of A. [ U ] . The problem 
may be generalized? we require that each row of the matrix be 
represented by at least alpha "l tt, s (where alpha is a positive integer). 
We shall use the terminology "minimal a-set of representatives for 
the (0,l) matrix, A"; and "a-width of A". This terminology is due 
also to Fulkerson and Ryser. [ h 1 . 

Thus the simple targeting problem may be solved by finding the 
1-width of the node-arc incidence matrix of the communications network. 
It is the purpose of this paper to present an algorithm for finding 
the a-width of any (0,1) matrix; and for specifying at least one 
minimal a-set of representatives for that matrix. Since we already 
have solved one problem of this type, we shall use this communications 
network and its associated incidence matrix for illustrative purposes 
throughout the balance of this paper. 

We now state the general problem which we desire to solves given 
a finite set, X, and a class, Y, of k non-empty subsets of X (but not 
necessarily the class of all non-empty subsets of X) , find a sub- 
class, Z, of Y, with the property that if x e X, then x is a member 
of at least a of the members of Z. This is a quite general problem, 
as will be shown in later sections of this paper. Any problem which 
can be formulated in the terms specified in this paragraph is capable 
of being solved by the algorithm to be presented. The incidence 
matrix for this abstract problem is constructed by listing members of 



6 



X vertically and subsets of X horizontally. Then we place a ,f l" 
in the i^^ 1 row and j^ 1 column if the i^ member of X is a member 
of the subset of X. 



2. The Class, lf(R,S) . 

Let A denote the (0,1) matrix of size m by n; that is, A is a 

matrix with m rows and n columns, each of whose elements is either 

zero or one. Let the sum of all of the elements of the i 0 ' row be 

denoted try rj; and the sum of all of the elements in the j^" 1 column 

be denoted by s,. That iss 
J J 



1 n 
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^ i=l J 
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• • • , m ) 



2 , . . . , n) 



T 

We call the column vector, (r^, ..., r ) *= R, the row sum vector; 

and the row vector, (s., s 9 , ..., s ) « S, the column sum vector. We 
denote try /l|(R,S) the class of all (0,1) matrices of size m by n with 
row sum vector, and column sum vector, R and S, respectively. 

From the class, ^(RjS), many very interesting combinatorial 
results may be obtained. An excellent survey of this material may be 
found in Ryser.[10J. We will be concerned primarily with a parameter, 
£ , or £(o), of the class, which is defined as the greatest lower 
bound on the a-width of any matrix in ^j(R,S) . That is, *£ is the 
a-width of the matrix in ^l((R,S) which has the smallest a -width of any 
matrix in the class. 
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Although not of concern until a later section, it will be of 
interest to determine under what conditions the class, *5L((R,S) is non- 
empty. Let 6. « (1, 1, ..., 1, 1, 0, 0, ..., 0) be an n-dimensional 
vector with the first r. components equal to one, and the remaining 
n - rj components equal to zero. We then define a matrix of the form, 



A 



m 



called the maximal matrix with row sum vector, R. It has column sum 

ra , — . n 

R 

. . l ' 

r 

fixed, S is unique, by definition of the <Sj, and the class %[(R } S ) , 



r - i TO r-i ^ 

vector, S ** (si , s 0 , ..., s ). Now since ) r* \ s . for 

12 n /-i-i 1 L* w J’ 



try a simple contradiction argument, has only one member; namely, A. 

Let Q * (q x , q 2 , ..., q R ) and Q* « (q*, q*, ..., q*) be any two 
k-dimensional vectors whose components are non-negative integers. We 
say that Q is majorized by Q*, denoted Q K Q», provided that with sub* 
scripts renumbered so that q^ > q 2 rl ^ q R ; and r! q| ^ ... rl q^, 
the following statements are true: 

(3) q +q +...+q ^q»- + q^+...+q^ ( j«l , 2 , . . . , k-1) 

1 d J 1 2 j 



(h) 



q i + q 2 



q^ + q| + 






We say that Q is normalized if q^ > ^ — *** = q R * q ^ ese ^wo definitions 
now enable us to give conditions under which ^i(R,S) is non-empty. 

Theorem 2.1 

Let R « (r 1s r_, ..., r ), and S «* (s,, s~, ..., s ) be two 
v 1* 2 m ’ 1 J 2 n 

normalized vectors whose components are non-negative integers, and 
^ — 1 m ^ — , n — 

such that ) r i "/ s > * Let ^ the maxi™ 3 !- matrix of size m 
i“l J 



by n, with row sum vector, R, and column sum vector S « (s^, s^, . ..,s^) 
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Then a necessary and sufficient condition that ^J(R,S) be non-empty 
is that S S. 

Proofs Assume S/4 S. Since \ S: s ) s. * \ r_, it 

^ — ' j — 1 ‘ j«=i J i = l i 

must be that S/4 S because equation (3), above, is violated, that is, 
for some k, it must be the case that s. + s 9 + . . . + s > s + s 
+ ... + s^. But then A is not maximal, since the first k columns of 
A contain more "l M, s than the first k columns of A« The hypothesis 
is that A is maxima 1, so we have arrived at a contradiction, thus 
demonstrating the necessity of the theorem. 

To show sufficiency? we shall construct a matrix, A e ^<((R,S) 
from the maxtrix A. This construction is due to Ryser. [9] . The 
construction will proceed by shifting ones in the i th row of A to 
other positions in the same row. We note again, that R, S, and S are 
all normalized, and that S 4 S. If s^ < s^, rearrange the ones in the 
rows of A so that only s^ ones remain in the first column. We may do 
this unless Sj > s^ ( j =2 , ..., n) , in which case, s^ + s^ + . • • + 
s„ > n«s, ^ s, + s n + . . . + s = s. + ... + s ; an absurdity. We 
continue by induction. Suppose that the first t columns of A have 
been rearranged. The matrix thus far constructed has the form, 



A 1 *[b x b 2 . . • b t b t+1 . . . b n j 
where there are Sj ones in the j column of A 1 (j « 1, ..., t). 

We now construct the (t+l) st column. Let the number of ones in 
the column be s* (j * t+1, . .., n). We may construct A 1 without 

loss of generality such that, s£ + ^ S ... ^ s^. Now it is possible 
that either s t+ ^ < s£ + ^, or that > s£ + p We consider each case 

in turns 
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Case Is s. 



t+1 ' s t+l 

Remove ones from the (t+1) column, placing them in other 
columns to the right. If sufficiently many ones may be removed by this 
procedure, the column of A is constructed, and we are finished. Sup- 
pose therefore, that there remain, d ones in column t+1, so that 
s t+i < d Let the matrix at this stage be denoted by ^e rs ^. 

Now if d > then for every e r ■ lj we must have e r . « 1 

(j - t+2,..., n) . Hence s t<f ^ + ... + s must at least equal d*(n-t). 

But s t+ ^ < d; s t+ 2 ^ s t+1 < d; etc., so that 

s t+l + ••• + s n < (n-t)’d S s t+1 + ... + s n 



an absurdity. 

Case II: s t+ ^ > s^^ 

^ f" 

Insert ones in the (t+1) column from columns to the right. If 
sufficiently many ones can be inserted, we are finished. We therefore 
assume that sufficiently many ones cannot be inserted by this procedure; 
in fact, we assume that column t+1 contains only d ones such that 
s t+l S d < s t+1 . Again, let the matrix at this stage of construction 
be denoted by [e rs ]. Then if e r * 0, it must be the case that 

e rj 

Then either e ( 

Consider the case in which e ^ ■ 0. Since s^ ^ s t+ ^ > d, there must 
exist epk * 1, and e^ - 0. We interchange e^j and e and also 
interchange ep^ and ep t+ ^. This increases the value of d by one 
without changing the value of any column sum for columns to the left 
of column t+1. Suppose we make all such interchanges and still, 
d < s t+ ^. This situation includes the case mentioned above, that 



0 (j » t+1, ..., n) . Now suppose that e^ « 1 for some j = t+2. 
jqk * 1 for all k S t+1, or else, for some k t, e ^ - 0. 



'qk 



** 1 for all k t+1. It is no longer possible to shift ones from 



columns t+2, ..., n; into columns 1, ..., t+1. This must mean that 
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either all of the ones for a given row are in columns to the left of 
column t+2$ or that all of elements of a given row to the left of 
column t+2 are equal to one. In either case* it must be that, 
s ^ + ... + s^ + d = s^ + ... + s^ + 

But then, since S -<( S, 

s ^ + ... + — s i + • • • + s t+l ( ~ s i + ••• + + d) 

whence s t+ ^ ^ dj contrary to the assumption. QED 

We now consider an extension of the concept of a-width. Let 
C *= (c^, ..., c^) be an m-dimensional vector of non-negative integers. 
We wish to find the smallest subset of columns of A e >t((R,S) such 
that the i^ row of A is represented by at least c^ ones in this sub- 
set of columns. Such a subset of columns will be called a minimal C- 
cover for A, and we shall denote its cardinality by e(C), called the 
C-width of A. Clearly, if C « (a, ..., a), then e(c) - e ( a ). We 
define e(C) to be the greatest lower bound on the C-width of any 
matrix in %R,S). e(C) can be estimated by p(C) as follows: 



We shall use this formula in the algorithm to be presented in section 
three. Note that if C = (a, . . . , a) j then p(C) *= the smallest integer 



(0,1) matrix. The branch and bound technique was suggested to me by 
D. R. Fulkerson of the RAND Corporation, Santa Monica, California, and 
is patterned after the branch and bound solution to the travelling 
salesman problem designed by Little, et alii. [ 7 ] 



(5>) p(C) ** the smallest integer such that 





3. Derivation of the Algorithm. 



Vie shall now describe our algorithm for finding the a-width of a 
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We have several techniques for estimating e( a) (which we shall 
henceforth call the ct-width of the class, ^U(R,S)). One such tech- 
nique is described in the preceding section, in which we compute the 
parameter, P. Now a given (0,1) matrix of size m by n, is a member of 
a class, %(R,S). We can partition the class, %[(R,S) into two sub- 
classes, one consisting of those matrices which have a selected column, 
say column p, as a member of a minimal a-set; and the other consisting 
of those matrices for which column pis not a member of any minimal 
a-set for the matrix. 

We thus have two sub-classes, each of which has no more members 
than the original class, and we know that the original matrix must be 
in one, and only one of the sub-classes. Consider the sub-class whose 
matrices have column p as a member of a minimal a-set. We may use 
this information to reduce the dimensions of all the matrices in the 
class as follows: if a * 1, then every row which has a one in column 

pis adequately represented try column p, and needs not be considered 
subsequently. If a ^ 1, we still may note that these same rows are 
represented once by column p, and thus need be represented only a- 1 
more times subsequently. Furthermore, we have made a "decision” 
about column p, namely that it is included in a minimal a-set of all 
matrices in this sub-class. We may thus reduce the dimensions of all 
matrices of the sub-class by one column, and (if a* 1) by a number of 
rows. If a «/ 1, we will keep track of those rows which yet need 
only a =. l representatives. Hence we have for this class a vector, 

C, whose components are either a, or a - 1. 

We may also reduce the dimensions of the matrices in the other 
sub-class by one column, for we have made a "decision" for this sub-class 
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namely, that no ot-set contains column p, for ary matrix of the sub-class 
Hence every row of the matrices of this sub-class needs to be subsequent 
ly represented a times, regardless of the value of a. 

Now let us estimate £ for each of the two sub-classes. It is clear 
that these two numbers are both estimates of the a-width of the original 
matrix. We may actually improve the estimate for the first sub-class 
discussed, by adding one to the estimate of e for the sub-class. This 
is to account for the inclusion of column p in any a-set for any matrix 
in this sub-class. Now, it is certain that the smaller of these two 
numbers is not greater than the a-width of the original matrix. 

Let us examine the sub-class corresponding to the smaller of the 
two estimates. We may partition this sub-class into two sub-classes, 
and so forth, until finally, some sub-class will be so small as to 
contain a unique matrix whose C-width we can determine by inspection. 
Part of such a continuing procedure is represented by the tree struc- 
ture of Figure 5>. 

Now at ary point in the procedure, the set of junctions (Fig. 5>) 
which have no lines leading toward another junction represent a 
partition of the class to which the original matrix belongs into two 
or more sub-classes. By an obvious extension of the above discussion, 
the smallest of the several estimates for the “-width of the original 
matrix is not larger than the a-width of the original matrix. We 
may then focus our attention on the sub-class corresponding to this 
smallest estimate, branching out from the corresponding junction until 
one of the earlier estimates of e is smaller than ary of the most 
recently constructed estimates. Now let us set up a formal algorithm 
based upon the preceding discussion. 
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Let the (0,1) matrix, A e ^(R,S) , be given, with dimensions 
m by n. A matrix is said to be normalized when both its row sum and 
column sum vectors are normalized, and when the elements of the matrix 
have been rearranged so as to fit the new row and column sum vectors. 
Clearly, we lose no generality by considering only normalized matrices. 
Therefore, throughout the remainder of this paper, we assume that all 
matrices and sub-matrices have been normalized as part of the operation 
of constructing them. 

Notation will, of necessity, become rather cumbersome, and for 
that reason, we now present such notation as we shall need in this 
section. There will be certain preliminary steps which serve to de- 
crease the amount of work required in the main part of the algorithm, 
and since these preliminary steps are not always applicable, we shall 
assume that the given matrix, A e ^(R,S) , is the one with which we 
shall enter the main part of the algorithm. 

The procedure in the algorithm is basically broken into two 
parts; (1) selecting a column for inspection and deriving the two 
sub-classes corresponding to the inclusion in, and the exclusion from 
the a-set of the selected column (the "branch" portion); and (2) 
estimating £ from each sub-class and choosing among all estimates, the 
smallest for the next iteration (the "bound" portion). We shall 
carry out the "bound" portion of the procedure by calculating p for 
each of the sub-classes and adding to p, the number of columns pre- 
viously included in the a-set on the current branch. Eventually we 
shall obtain a sub-class of matrices, one of whose dimensions is zero, 
and is thus, empty. Clearly, e for this sub-class is zero. We can 
make a test for completion at this point. If the test fails, we 



IS 



continue the algorithm along some other branch. It can be seen that 
we shall derive an even number of different sub-matrices of A before 
we reach termination. We shall subscript these sub-matrices in the 
order in which they are derived. Associated with each of the sub- 
matrices, of course, will be a row sum vector, a column sum vector, an 
estimate of the C-width of the class to which the sub-matrix belongs, 
and an estimate of the a-width of A based upon the condition that it 
can be obtained by continuing along the branch from which we derived 
the present sub-matrix. Note that since we may discontinue considera- 
tion of one branch at any time, and return to a previously discontinued 
branch j the subscripts of the matrices which we shall derive at ary 
point of the procedure bear no relation to the subscript of the matrix 
from which the derivation follows. This point will be made again 
during our step by step description of the algorithm. 

Now, we subscript every parameter associated with a particular 
sub-matrix with the same sub-script as its associated sub-matrix. We 
shall also require a "label" for each sub-matrix, and the typical 
label will be of the form "a,b,c,d, . . .". This label gives us the 
information that for each particular sub-matrix, every column of A 
which is present in the label has been branched uponj and those columns 
which appear unbarred are assumed to be included in the a-set, whereas 
those which appear with a bar over them are assumed to be excluded 
from the a-set. Thus, the a,b,c,d in the example label above represent 
positive integers which are the column numbers of the original A matrix. 
One further convention to which we shall adherej an even sub-script 
is taken to mean that the latest column upon which we branched is 
considered to be included in the a-set associated with the sub-matrix, 
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and thus this column number will appear unbarred in the associated 
label. On the other hand, an odd subscript is taken to mean that the 
latest column branched upon is considered to be excluded from the a-set 
associated with the sub-matrix, and thus this column number will appear 
in the associated label with a bar over it. The described notation is 
summarized be lows 



Ap £ ^(R , S ) has dimensions m try n . 

p p _ P P 

T 

R p - < r pl' r p2’ •••’ • 

S p " (s pl’ V’ S pnp )- 



E k ^ — i m 

s . % ) c , 



e ' ** D + the number of columns unbarred in the label of A , 
P P P 



It is obvious that much information must be recorded for each of 

several matrices. Although a structure similar to that of Figure £ 

could be used, we suggest the format of Figure 6. This figure shows 

a typical matrix Ap and all of the required information associated 

with this matrix. It will be convenient to suppress zero elements of 

the matrix. Note that we list the subscripts of the columns of the 

original matrix along the top, and directly below that, the order of 

subscripts for the derived matrix, A^. The order of subscripts for 

the rows of Ap is listed along the left side of the matrix, and Rp and 

Sp are listed along the right side and the bottom, respectively. At 

some convenient point we list p , e' , and the label associated with 

P P 

the matrix. 

In section four, we solve the targeting problem of section one 
using this algorithm. The reader may desire to read section four 
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concurrently with the description of the algorithm which follows. 



3.1 Preliminary Steps . 

PI. If r < a; the a-width does not exist. We terminate, or else 
m 9 

decide to look for an a-width in which a is a smaller integer than 
that which the original problem specified. 

P2. If r m > a; go directly to step SI, in the main part of the 
algorithm. 
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Figure 6 



P3 



If r ** r , 
m m-1 



* r = a, for some k, (0 ^ k^m-l): 
m-k 



then it is evident that each u l ,r in any of these k+1 rows must belong 

to a column in the minimal a-set of representatives for A. Therefore, 

in each such row, say the i^ 1 , for each j such that a.. « 1, record 

^ J 

that the column is in the minimal a-set and delete the column 
from the matrix. Let C * (a, ..., a) be an m-dimensional vector. For 



each k such that a^j * 1 subtract one from the k^* 1 component of C. 



18 



When this has been done for all columns, j, deleted from the matrix, 
delete any row, i, for which c. II 0. Finally, recompute new row sum 
and column sum vectors, normalize the new matrix, and proceed to 
step SI in the main portion of the algorithm. Now the set of columns 
that has been deleted in this preliminary step will not again be ex- 
plicitly mentioned. The reader is cautioned to remember to add 
these columns to the a~set computed in the next section in order to 
arrive at the true a-width of the matrix, A. 

3.2 The Branch and Bound Algorithm. 

51. We are given A e ^J[(R,S) which has been normalized. If C was 
not computed in step P3, let C » (a, ..., a). Cross out column one of A. 
We shall branch on this column because it is the column with the largest 
column sum. This is an entirely arbitrary decision. We could branch 

on any column whatsoever, but it seems reasonable that the one with the 
largest column sum would be likely to be included in the a-set. A 
counterexample is easy to construct. In any case, it is now necessary 
to decide whether or not to include this column in the a-set. 

52. Let us denote the matrix, A by [6 6 ... 6 ]. Construct 

id. n 

A.^* 5 ^2 "S * ** anci ^ "l" ° We examine the consequences of 

excluding column one from the a-set. A^ is of size m^ by n^ (=n-l), 
and A^ e Our decision means that we still must locate c. 

representatives for each row, but that we may not use arry of the ** 1 11 * s 
in the excluded column of A. Calculate by equation (2-5), and 
since no columns are unbarred in the label, let * p^ . 

S3« We next examine the consequences of including column one 
in the a- S et. Construct A2 by deleting column one from A, forming a 
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temporary vector, and deleting every row for which the component- 
wise difference of R and R^ is greater than or equal to the corres- 
ponding component of C. That is, if r. - r^. ~ c. j delete row i. 
Clearly this can happen at this step only if c. ■ 1. We have now 
reduced A by one column and perhaps some number of rows. This re- 
duced matrix, when normalized is called A 2 , and we now form the 
permanent vectors, R 2 and S 2 . We label this sub-matrix, "1 st . is 
calculated by equation (2-5) > and since column one is unbarred in the 

associated label, e* * + 1. 

2 2 

Slu We must now decide along which branch it will be most 
profitable to continue. We make the decision try choosing the sub- 
matrix associated with min [e^, ei,]. If e’ ** the choice is 

arbitrary. When using the algorithm for hand computation, the best 
choice is probably that matrix with the greatest number of unbarred 
columns in the associated label, that is, in this case, A^. Having 
made this decision, we set min [e!^, ejj « «, so that the same 
branch will not be chosen again at a later stage. We proceed to step 
S£. All succeeding steps in the algorithm will be described in 
general terms. 

S£. In the preceeding step, we decided to proceed using matrix 
A^, say, with associated label, n p,q,r,s,t,u”| a particular one of the 
k matrices thus far constructed (k L) . Since A^ has been normalized, 
the first column has the largest column sum. We therefore select 
this column as the next branch point. Let us say that this column 
corresponds to the v^ column of A. 

S6. Denoting A^ by [6^, 6^, ..., 6^^], we construct the next 
sub-matrix, •••> ^Lm^* thus ;f ' ind ^ n 9 sub-matrix 
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corresponding to a decision to exclude column one of A^ (column v 
of A) from the ct»set. A^ + j_ is of size m^ try n^,* and A^+j 

e ^ R k+l > S k+1^° 

57. Noting that columns p, q* and s of A have been included up 
to this stage* we form a vector of row sums of included columns* which 
we shall call RS. That is, referring back to. the A matrix, we compute 
for each row, the number of ones in columns p, q, and s. Obviously, 
for this particular label, the sum cannot exceed three. Now we 
compute a test vector, RT. Let the i th component of RT be the maximum 
of zero and c. - rs^ (the i th component of RS). The vector, RT, gives 
us the number of ones yet to be included in each row of the matrix, 

by some subsequent choice of columns. Since both RS and RT are 
vectors which are required only at this branch* and will thence be 
discarded, there is no need to subscript them. 

58. Returning to our decision to exclude column one of A^, we 

examine each component of R^^. If any component of R^ + ^ is less 

than its corresponding component of RT, it is infeasible to exclude 

this column. We set e * , s °°* but retain the label of A, . which is, 

k+1 k+l 9 

in this case, ,, p,q*r,s*t*u,v ,t . We then proceed to the next step. 

If, on the other hand, we determine that the label represents a 
feasible set of columns, that is, no component of R^ + ^ is less than 
its corresponding component of RT , we compute p , by equation (2-5) 
using the vector RT in place of C. Since there are three unbarred 
columns in this typical label* we set ~ p^ +1 + 3. 

S9» We now construct matrix A^ + 2 with label "p,q,r,s, t ,u,v" . 

This matrix corresponds to the decision to include column v of A in 
the a-set. We delete column one of A^. For every row which had a 
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"I" column one of A^, we subtract one from the appropriate component 
of RT . If this component now is zero, we delete the corresponding 
row of A^. When the procedure is completed, we have the matrix Aj^* 

We compute R^g and ^k+2’ and equals p ^2 * h 

in this case. If, however, either dimension of the matrix becomes 
zero at this step, we proceed to step Sll, as it is possible that 
this represents termination. p ^ + 2 > course, is computed using the 
vector RT instead of C. 

510. Let P be the k+2 dimensional vector whose components are 
the el. We find the minimum component of P, choosing arbitrarily 
in case of a tie, and use this component's corresponding matrix for 
our next path. Although an arbitrary choice in case of a tie will lead 
to solution, there are two techniques for choosing between branches 
that will probably shorten the algorithm somewhat. These are, either 

to stay with the current branch in case of a tie in which the current 
branch is involved! or to take the branch which has the largest number 
of unbarred columns in its label. The second method is probably the 
best, but in the computer algorithm we shall use neither technique! 
branching instead on the matrix with the smallest sub-script because 
of programming simplicity. Let us say we have chosen matrix A^ for 
our branching matrix. We set « °°, and return to step S£, con- 
tinuing the algorithm. 

511. Since A ^2 °f zero dimension! P ^ + 2 m Then is 

equal to the number of columns that are unbarred in the label of 
A-k+2* Now if e ^2 > e [ ^ or an ^ * < ^+2, we have not necessarily 
found a solution, so we return to step S10, after duly recording the 
proper values for all of the parameters associated with this sub^matrix. 
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Of course, it is now meaningless to consider row sum and column sum 
vectors. This is a minor point, since the only purpose of these vec- 
tors is in computing p, and in deriving subsequent matrices. It is 
clear, though, that if we at a later time choose to branch upon this 
matrix of zero dimension, it is because we have found that it is after 
all, an optimal solution to our problem. 

If, on the other hand, e ^ + 2 ^ el for all i = k+2, the unbarred 
columns in the label of A ^2 constitute a minimal a-set of representa- 
tives for A, and the cardinality of this set of columns is the a- 
width of A. Thus we have arrived at a termination point of the 
algorithm. In the next sub-section, we shall prove that the algorithm 
does find a minimal a -set of representatives, and that it terminates 
in a finite number of steps. 

3.3 Proof that a Solution is Reached . 

We need to show that the algorithm does find a minimal a-set of 
representatives even though many possible combinations of columns 
have not been considered. It is first necessary, though, to show 
conditions under which the a-width exists. We have alreacfy stated, in 
step PI, that if a < r , the a~width does not exist. We now prove a 
necessary and sufficient condition for the existence of the more 
general C-cover of A; 

Theorem 3»1 « 

The matrix, A, has a C-width for every vector, C, whose components, 
c- are bounded above by rj. 

Proofs By hypothesis, c. fE r^j hence A, itself, is a C-cover for 
every admissible vector, C. For a fixed C, the collection of all 
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C~covers is thus non-empty , and clearly is finite. Then the collec- 
tion has a minimal member, and the cardinality of this minimal mem- 
ber is the C~width of A. 

QEC 

Corollary ; 

The matrix. A, has an a-width, e(ot), for each integer a in the 
interval, l — a r . 

We shall now demonstrate that the branch and bound technique 
of section 3.2 will find the minimal C-cover of A in a finite number 
of iterations. We shall further show, that the branch and bound 
technique is independent of the technique for computing a bound on 
e , under some rather simple restrictions. We shall call c(C), the 
C-width of the class, ^(R,S). Wow e (C) is clearly a function of 
the dimensions of the matrices in the class. Let P*(C) be an esti- 
mate of ^(C) such that p-*(C) =^(C), and such that for the class > 

^ .—A. 

one of whose dimensions is zero, p-*(0) ** e(0) e Oj where 0 is the m 
dimensional zero vector. We insist in. what follows that the esti- 
mating technique for computing p#(C) be applied consistently. The 
parameter, P, described in section two satisfies the above require- 
ments on P*(C). 

Let A be a given matrix and estimate ^(C) by p-s-(C). Then the 
C-width of A is not less than p*(C). Now construct matrices A^ and 
A^ as in steps S6 and S9 of section 3.2 using any column of A, say 
column t, instead of that column whose sum is the largest. Estimate 
e(Cj) for each of the sub-matrices thus constructed by p^Cj), and 
p 2^2) respectively. Then the C^-width of A^ is not less than p*(C^), 
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and the C 2 ~width of is not less than p-|(C^). The vectors, C., 
have components equal to the number of !, l w, s yet necessary to rep- 
resent the row of A. For example, if the i^ component of C 
were 2 and the selected column contained a one in its i L place, then 
the i th component of would be 2$ but the i th component of would 
be 1. Now since column t of A must be either included in, or excluded 
from the minimal C-cover of A, the C-width of A is not less than 
min l p ^(Cj_), P|(C 2 )+ll . We need no longer consider p-*(C) as an 

estimate of the C-width of A. Clearly then, if we repeat this esti- 
mating process, using A^ or A 2 a s a new given matrix according to 
whether p|(C^) or p^(C^)+l is the smaller, we may compute two 
additional estimates of the C-width of A. Eventually (after a finite 
number of such estimates have been made), we shall construct a matrix, 
one of whose dimensions is zero. In that case, p-|^(C 2 r) “ 0, and 
the C-width of A cannot be less than the cardinality of the set of 
columns slated for inclusion in the C-cover of A. This set of columns 
is, in fact, a C-cover, and if the cardinality of this set is less 
than or equal to all of the other computed estimates of the C-width 
of A, then it is a minimal C-cover, since we required that ary esti- 
mate be bounded above by e(C) . 

We refer the reader once again to the scheme illustrated in 
Figure 5>« If each branch of this tree were to be taken to its termi- 
nation (at worst, the point at which each column of A had been tested 
either for inclusion or exclusion), each such terminal could be 
represented by an n-tuple as follows? let the i^ component be one 
if the i^* 1 column had been included on this branch, and let it be 
zero otherwise. There are 2 n unique n-tuples, hence at most 2 n 
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corresponding terminals, each attainable in a finite number of steps. 
Hence the algorithm must terminate in a finite number of steps. 

It. Manual Computation with the Algorithm. 

Let us return to the targeting problem described in section one, 
and solve this problem to illustrate the use of the branch and bound 
algorithm. We reproduce the matrix of Figure 3 , as Figure 7 for 
ready reference. Zeros have been suppressed, and we have appended the 
components of R and S to the right and bottom of the matrix, respectively. 
Figure 8 depicts the normalized matrix, A. We have appended the original 
column subscripts above the matrix. 

In this example, ij and it should be noted that in general, 
increasing a, significantly increases the complexity of the manual 
algorithm, because of snort -cuts used in deciding which rows may be 
deleted. These short-cuts are not available for a 1. The RT vector 
need not be constructed, since its components could only be zero or 
one, and such a simple vector can be handled by inspection. However, 
the short-cuts cannot be conveniently programmed, so the computer 
version of the algorithm can handle differing a»s with almost equal 
facility. 
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We show, in Figures 9 and 10, the matrices A^ and A 2 > respectively, 
derived from A as follows? We delete column 1 from A and since this 
column corresponds to column 3 of the original matrix, we label A^, 

"3". Naturally, we have normalized both and S^. Now locate each 
row of A which has a "l" in the first column. Delete this row, delete 
column 1, and we now have A 2 , after normalizing R^ and S^. This 
criterion for deleting rows is a simplification of computing RT , which 
is the short-cut mentioned at the beginning of this section. The 
rows deleted in the example are rows B, C, G, and F. We label A 2 , 



»3". 



For the matrix, A 
for A^, P 2 “ b, since 




e» 

2 



?2 + 1 « !>• 



since 
« m 2 ** 




lu Hence 



« m^ « 8. Similarly, 

cl - p, - h; and 
1 1 
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r 4 

Min ® so we choose to branch on matrix A 
set ej « «. Column 1 of A^ corresponds to column 2 of the 
matrix, and is the column which has the largest column sum 
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We delete column one of A^ and thus have matrix A^, which we 
label ”3,2". This sub-matrix is shown in Figure 11, below. Since 
row B of A^ has row sum zero in Ay and since there are no included 
columns in the label, this represents an infeasible set of column 
exclusions. Therefore, without further consideration, we set 
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Sub-matrix A. 
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Next, we delete each row of A^ which has a "1" in the first 
column, namely, rows A and Bj and we delete the first column of A 



This gives us matrix Ai , depicted on the next page in Figure 12. Of 

~ V -1 ^ 

course, the label for A^ is "3,2". Now ^ ^ s^j = m^ « 6; so p^ « 3, 
and + 1 = ii. Note that the sum of column h of A^ goes to zero 

in A^, so we may delete it. 

Now min [e*, e', e', ef] ~ e' ® ii. Hence we choose to branch next on 
12 3a k 

matrix A, . We set e .* * ®, and choose column one of A. for examination. 
hh li 
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Deleting this column, which corresponds to column h of the 

original matrix, produces A e , with label ”3,2,1;". This sub-matrix is 

5 

reproduced in Figure 13, below. Note that the sum of row C of A^ has 
gone to zero in A^. 
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This means that the label represents an infeasible combination of 



columns, and we therefore set «= 00 without further consideration of 
this sub-matrix. 



Now 



we derive sub-matrix A^ by deleting rows D and C from A^, 



since each of these rows has a " 1" in column one of A, 



We also delete 



column one of Ai and give this sub-matrix the label, "3, 2, id' . The 

<r~ » 2 

matrix is presented in Figure lij. below. Since ) s , . * m. « h, 

^ i»i 61 6 

we have that p, « 2, and e' « lu since there are two unbarred columns 

6 6 

in the label for A^. Clearly * min [e'„ ] (i=l,..., 6); so we choose 
to continue along this branch. Thus A^ will be our next branching 
matrix. 
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We derive sub-matrix A^ from A^ (see Figure 15>) by deleting 
column one of A^, corresponding to column 7 of the incidence matrix. 
This sub-matrix has label, "3, 2,1*, 7". Once again we run into the 
situation of a row sum going to zero, so this sub-matrix, too, repre- 
sents an infeasible combination of columns. Therefore, we set e* « 00 , 

7 

and proceed. 
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By the deletion of rows E and F from A^, we arrive at sub-matrix 
Ag, which also has column one of A^ deleted. This two by two matrix 
is displayed in Figure 16, below. The label is "3 ? 2,h,7 n , and we 
note that s g ^ s mg ® 2$ therefore, Pg ® 1, and since there are three 
unbarred columns in the label, = U. Note also that the column 
sum of column 6 of the original matrix has gone to zero, so we delete 
that column also. We see that = min [e|] (i e l, 8), so we 

branch on matrix Ag. We remember to set ® « f and choose column one 
of Ag for examination. This column corresponds to column 8 of the 
original matrix. 

8 9 

1 2 R 8 

H 1 1 2 

G 1 1 

ie 2 1 

Sub-matrix A fl "3,2, U, 7" P *= 1 e * = h 
0 8 8 

Figure 16 



32 



Deletion of column one of A Q gives us a one by one sub-matrix 

o 

which has label "3,2, li, 7 > 8” • This matrix represents an infeasible 
combination of columns since row G has vanished. Thus we set 
and proceed. 

We see immediately that we have reached termination, since the 
matrix A^ is of zero dimension and has label u 3,2,li,7,8". This 
means that * Li and that « min [e^] (i = l, ..., 10). 

The 1-width of the incidence matrix is li, and a minimal 1-set of 
representatives for the incidence matrix is the set of columns, ( 2 , li , 
7,8). These, of course, would be the station numbers that were to be 
targeted in our original problem. 

Computation of e (°Q . 

We notice that for the very simple problem presented in section 
four, ten matrices had to be written down. The writer has observed 
that in hand computation, one matrix can be used for deriving only 
two or three sub-matrices before the paper becomes impossible to read. 
Even a small matrix requires a considerable amount of time to write 
down, especially when normalization cannot be done in one's head. In 
the computer version, due to limited storage space it is necessary 
to recompute a matrix each time it must be used, so even at high 
digital computer speeds, it would be desirable to reduce as far as 
possible the number of matrices that had to be examined. 

Unfortunately, for the computation by hand, little can be done 
to simplify the problem, but in the case of the computer algorithm, it 
is possible to compute a) exactly at little expense in time. Un- 
fortunately, this computation will be useful only when a« lj and 
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hence, when the components of the RT vector of section three can be 
only zero or one. But this case is the one which is of greatest 
interest, and it is thus very worthwhile to study this computation. 

There are at least two derivations possible. One, which is 
entirely combinatorial in nature, gives considerable insight into 
the class, ^?./(R,S), at the expense of being quite lengthy and not 
very intuitive. The interested reader is referred to Fulkerson and 
Byser. [ li] . 

We shall use a network derivation which is considerably shorter 
and more intuitive. The procedure for a** 1 is outlined in [2]. It 
should be noted that the formula was first derived using network con- 
siderations. We require the following theorem in the network deriva- 
tion to follows 
Theorem 1 . 

Let A e <5 i/(R,S) have a-width, e(a). Then there is at least one 
matrix, A , in £/(R,S) such that the first e columns of A constitute 
a minimal a-set of representatives for A £ . 

Proofs Consider ary matrix, A e ^/(R,S) with a-width, e(a). 

Let E* be that subset of the columns of A consisting only of the members 
of the minimal a-set of representatives. If E* is the first e(a) 
columns of A, then A ** A £ . Therefore we assume that column p is the 
leftmost column of A not in E*. Now locate column k such that column 
k is the rightmost column of A in E*. 

Let Rg ^ ( r El* ° • • j r £ m ) the vector of row sums of E*. Now 

if r_. « a and there is no a. « 0 for which a ** 1, (i»l, .... m) , 

Ei ip ik 

we may replace column k by column p in E-*, and the new columns of E* 

are a minimal a-set of representatives. Suppose therefor^, that we 

have for some i, r.,. » a , a. * 0 and a.. « 1. We call a., a critical 
? Ei ip ik ik 
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1 for which 



one of E*. Then since s > s. , there must be an a 

P “ k jP 

a.. ** 0. ( j 5^1 ) o Further, for each critical one in column k, there 

is a distinct one in column p with a corresponding zero in column k. 

We perform interchanges on such critical ones, the typical interchange 

resulting in a. » 1; a,, = 0; a. = Oj and a = 1. We may now 
a ip ik JP jk 

replace column k by column p in E-* and the new columns of E* form a 
minimal a-set of representatives* 

Clearly this construction is possible for each column to the left 
of column e, which is not in E*. Hence the construction yields 

e %(R,S). QED 

£.1 A Supply-Demand Network . 

In this section we shall consider directed networks. Let N 
represent the set of nodes of a network and (Q represent the set of arcs. 
We denote an arc between x and y, members of N| by the ordered pair, 
(x,y) and assert that the notation implies the arc is directed from x 
to y, and is not the same arc as the one denoted (y,x) . We associate 
with each arc in ($, a non-negative function c(x,y) called a capacity 
function, and a non-negative function f(x,y) called a flow function. 

We associate with some nodes in N a non-negative function a(x) which 
may be thought of as a supply of some commodity available at node x, 
and we associate with some other nodes in N, a non-negative function 
b(y) which may be thought of as a demand for some commodity try node y. 

We make use of the following shorthand notation, Let S, T , be 
subsets of N, and let x,y be elements of N. Then by c(S,x) we mean 
seS and similarly for f(S,x). Also, try c(S,T) we mean 

seS teT c ( s >k)j and similarly for f(S,T). Analagous shorthand will 
be used for the functions, a and b. 
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Now let us assume we have a class of matrices, ^/(R^S), We 

devise a network for this class as follows s Let there be n nodes 

,th 



denoted b 



b^i with demand function, b(bj) *= s^, the j 



component of So Let there be m nodes denoted a^, . a^| with supply 
function a(a.) « r^, the i component of R. Let B « b^| A » a^. 

Let (a^bj) e (& for all i, jo Let c(a.,bj) « 1 for all i,j. This 
network has an arc capacity of one for each of the m*n elements of a 
matrix, A e ^((R,S), The commodity available at the nodes of A, and 
required by the nodes of B is, of course, ,, l ,,, s to distribute among 
these m»n elements of the associated class of matrices. We construct 
a flow in the network satisfying the following constraints? 



(6) 


f(x,N) - f(N,x)^ a(x) 


X G 


(7) 


f(N,x) = f(x,N) = b(x) 


X E 


(8) 


0 ^ f(x,y) = c(x,y) 


(x,y) 



Clearly this construction is possible. For R » 2, 2, 2, 2) and 
S *(3j3>2) such a network has been constructed in Figure 17. The 
number by each arc is the value of the flow function for that arc. 

Now let us construct the corresponding matrix (Figure 18) , If 
f(a^,bj) 1, let a 3 -j s 1$ if f(aj,bj) « 0, let ajj * 0. This matrix, 
A, is, in the class, ^l/(R,S). Furthermore, each unique feasible flow 
corresponds to a unique matrix, A e ^l/(R,S), and conversely. 

Then let us ask this question? under what conditions can we 
construct a flow so that f(a^,T)^a for each i, and for T « {b ? , ,,«, 
b^ ) ? This flow would correspond to distributing at least ot ones 
from each a^ to the nodes corresponding to the first e columns of a 
matrix in ^/(R,S). If vre can locate the smallest e for which this 
flow is feasible, we shall have found £. See Theorem 5»1« 
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a(a L ) * 2 



b(b 1 ) - 3 




Figure 17 



110 2 
110 2 
10 12 
0 112 

3 3 2 

Figure 18 
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Figure 19 
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Figure 20 



38 



Since 



pi ^ — ) n 

r. = ) s,; for the demands to be satisfied at 

i-i 1 ^j-i J 

each bj, the supply at the nodes of A must be totally exhausted for a 

feasible flow. Consider the network of Figure 19* with four sink 

nodes (those with positive demand functions), and three source nodes 

(those with positive supply functions). Let us construct for each 

e 5r n: a network as follows? Let B be the first E nodes of B. Let 

e 

(x,y)e d for x e A and y E B . Now construct m new nodes 

(a!, .... a') and let the set of these nodes be called A'. Let 

(a., a*. )e (H for each i ^m. Let (x,y)e (Q, for x e A' and y e B^B 
l i e 

(the relative complement of B^with respect to B) . Let the following 
be true? 




a(a. ) « r. 

b(b .) « s . 

J J 

c(a.,b ) = c(a! ?b^) 



c(a. ,a' ) = r . - a 
i i i 



m 



2 . y • • • y FI 



m 



j = • • • > £ 

k =f+l , . . . , n 






m 



The construction corresponding to the network of Figure 19 for 
e = 2 is shown in Figure 20 on the preceding page. Numbers above 
each arc are the capacities of the arc. What we have done is this? 
since the capacity of each (a.,aj) is a units less than the supply at 
a^, at least a units of supply must be distributed to the nodes of B £ . 
We call a flow feasible if and only if constraints (6), (7), and (8)1 
are satisfied and if 

(9) f(x,N) - f(N,x) * 0 x e A* 

is also satisfied. Clearly, the smallest e for which a feasible flow 
exists in this type of network is e(a). 
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Theorem £.2. 



The 


constraints 




(6) 


f(x,N) - f(N,x) 5 a(x) 


x e A 


(7) 


f(N,x) - f (x,N) % b(x) 


X E B 


(8) 


0 = f(x,y) = c(x,y) 


(x,y)e GL 


(9) 


f (x,N) - f (N,x) « 0 


x e A' 



where a(x) b(x) ^ Oj are feasible if and only if, 

(10) b(B flX) - a(A 0 X) % c(X,X) 



holds for every partition of N into subsets X and X (= N^X). 

This is the well known supply -demand theorem due to Gale. A 
proof may be found in l 2 ] . 

We apply (1C) to our network (in general) and observe that for 
partitions of the forms 

X K (a, , a , a!, ©••, a ^ s b , , . ® • , b * b^*.,, b } and 

1 ml e 1 e 1+1 7 n 

X * {a e+T a i!i ,c he+l’ ** 0, b f } 

where e and f are integer parameters satisfying 

0 m; f ^ n 

(10) is of the forms 

(11) S £+1 + ... + s f < ( r e+1 - a) + ... + (r^ -a) + e*(f - e) 

The validity of the inequality is obvious except possibly for the 
term, e*(f - e). This term is merely the number of a’ in X, times the 
number of b j in Xj and is the total capacity of all arcs connecting 
these two sets of nodes. 

Theorem £.3 . 

The constraints, (6), (7), (8), and (9) of Theorem £.2 are feasible 
for a network of the type of Figure 20, and for fixed e, if and only if 

(11) holds for all permissible values of e and f. 
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The proof of this theorem consists of looking at all subsets of 
nodes not of the form on the preceding page, and verifying that Theorem 
5.2 is valid for these subsets if it is valid for the subsets of the 
above form. Since the proof is not very interesting, and rather 
lengthy, it is omitted. 

Theorem 5°3 assures us that we need not test all subsets of nodes 
with (10) in order to assure ourselves that we have a feasible flow. 

Now let us multiply (11) by minus one| and rearrange terms? We arrive 
at ? 

(12) r e+1 + ... + r m - (s £ + 1 + ... + s f )+ e*(f-e) = a-(m-e) 

We thus have the condition that ^ is the smallest integer for which 
(12) is satisfied for all integer values of e and f in the ranges? 

0 ~ e m e Ir f If n 

Now the left side of (12) is the class invariant which Fulkerson 
and Ryser call N(e,e,f). [liK For ease of computation we define a 
function, Q(e,e,f) s N(e,e,f) - a * (m - e) . 'e' is the smallest e for 
which 

(13) Q(e ,e,f ) % 0 Of e! mj e % n 

We take first differences with respect to e, e and f$ and derive the 
following recursion formulas? 

(Ill) Q(e+l,e,f) - Q(e,e,f) + s E+1 ~ e 

(15) Q( c , e+l,f) - Q(e,e,f) + f + a - e - r g+1 

(16) Q(£,e,f+1) * Q(e, e ,f) + e » s f+1 

Since we know that P is a lower bound on the a-width of ary 

A e ^j[(R,S) we may take e ~ p, and compute an m+1 by n~e array 

making liberal use of (15) and 0.6) • If one of the numbers is negative, 
we increment e by one using ( lli) , and compute the m+1 by n~e array 
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for this new value of e. When an array is found which contains no 
negative members, we have found e. 

Now, as we mentioned before, we do not advise this procedure for 
hand computation, and it cannot be used for a greater than one, but 
the case a ■ 1 is the most important case by far, as will be seen in 
section seven, and a digital computer is admirably suited to perform 
these simple arithmetic computations. In the next section we shall 
discuss the computer program in which the above formula was usedj 
and the results obtained with a large number of matrices. 

6. The Algorithm Program . 

We present a procedural flow chart for the branch and bound algorithm 
in Figure 21. As much as possible of the procedure is described in 
abbreviated, but intuitive language. Where variable names are neces- 
sary they are either the names given to the same variables used in 
Sections 3*1 and 3*2, or they are defined on the flow chart itself near 
the point at which they are first used. Variable names are used that are 
in reasonable agreement with the corresponding names used in the computer 
program. 

The algorithm was programmed for the Control Data 160U computer 
using FORTRAN 63 source language, and CODAP 1 assembly language. The 
assembled program is included in this paper as Appendix I. There are 
several features of the program which deserve discussion. 

Since we are dealing with matrices composed of zeros and ones, 
storage space can be conserved by letting a single bit represent an 
element of the matrix. Then we may use logical operations to manipulate 
the matrix. Normally this would require writing the entire program at 
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the assembly language level; but we avoid this by taking advantage of 
a capability of FORTRAN 63 which permits the programmer to define his 
own type of arithmetic. 

The CDC l6oU word size is 1x8 bits. We chose to write the program 
to accept matrices up to dimensions llilx by llilj. We store a single row 
of the matrix in three consecutive computer words; hence an entire 
matrix requires only h32 words of storage. The first word of row i 
contains elements a.^ through the second word contains a , ^ 

through and the third word contains a.^y through a.^^. 

We define, according to the rules of FORTRAN 63 , a TYPE LOGICS 
arithmetic in which an elemental word consists of three consecutive 
words of memory. Vie call such an elemental word a TYPE LOGIC!? word. 

Thus, one LOGIC!? word is equivalent to one entire row of a matrix, or 
other variable which needs to be three computer words in length. For 
instance, we shall require several masks with which to derive the various 
sub-matrices, and each such mask must consist of three computer words. 

We shall need to take logical sums and products, to complement 

words, to clear words of ones in certain bit positions; and we shall 

require a method of generating a 1 in any of the Ibb bit positions of 

a LCGIC!? word. We define, through the subroutine, Q1QMATH, the symbol 

n * n to mean logical sum; the symbol to mean logical product; the 

symbol "<=■" with two arguments to mean "set the i t ^ bit of the first 

f h 

argument to zero if the 1 bit of the second argument is one*’ ; and 
the symbol with one argument to mean "complement the argument". 

We also define "ARGUMENT /j" to mean "set the j^ 11 bit of the argument 
to one, and all other bits to zero, counting from the leftmost bit 
position of the argument". 
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The only requirement we have for generating the sub-matrices of 
the given matrix is that we must be able to compute the corresponding 
row sums and column sums for use in estimating e. We may compute the 
row and column sums in the computer program without deriving each of 
the sub-matrices through the use of suitable masks. We require two 
such masks; one is a mask of columns upon which the program has already 
branched; and the other is a mask of columns chosen for inclusion 
in the minimal a-set at the current branch. In each case, a 1 in the 
i^ bit position of a mask indicates that column i is a member of the 
set of columns which the mask represents. 

Almost all arguments used in the various subroutines are stored 
in COMMON. This decreases the computation time at the expense of re- 
quiring difficult to follow indexing of the parameters. Most such 
parameters are stored in an array, JDATA. This array is really three 
consecutive arrays of parameters associated respectively with the 
matrices A^, &2K’ and ^2K+1 figure 2l. The correspondence between 
IDATA and the mnemonic variable names may be found in the EQUIVALENCE 
statement near the beginning of the program. 

The masks and bound of all matrices must be retained in storage, 
but other parameters, (row sums, column sums, dimensions, etc.) are 
recomputed each time they are required. If a random access storage 
device (such as a magnetic disc) is available a savings of computation 
time would result from the storage of these parameters. 

Up to 2000 sets of parameters can be retained in core storage 
simultaneously. When this limit is reached, the section of the program 
from statements 192 to 193 searches for any sets of parameters no longer 
required, discards them and compresses the remaining parameters into the 
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front of the storage area. This effectively increases storage space 
up to the point at which there are 2000 current branches of the algorithm. 
(Current branches are those branches for which the corresponding esti- 
mate of e is less than n inf inity" . ) 

Sample output is shown in Figure 22. This matrix can be recognized 
as the incidence matrix of the communications network discussed in 
Section one. Note that the matrix is printed in octal format which 
must be converted by hand to the proper (0,1) form. Each digit of the 
output represents three elements of the matrix; for example, the digit 
M 5" represents the three elements "1, 0, 1” . 

Short, but descriptive comments separate major sections of the 
program listing by tasks, and introduce each of the subroutines. The 
various CDC l6oh instruction manuals and programming manuals may be 
consulted for further information. 

The program is not very efficient in its present forms marry pro- 
gramming conveniences such as the use of TYPE LOGICS arithmetic, and 
the use of subroutines, makes writing of the program simpler at the 
expense of generating many otherwise unnecessary instructions. As a 
first step toward improving the efficiency, the author recommends 
elimination of TYPE LOGIC!? arithmetic, substituting in its place, 

C0DAP1 subroutines to perform the necessary substitute operations, and 
using direct calls to these subroutines in place of the operations 
symbols. In addition, it is recommended that all present subroutines 
written in FORTRAN 63 be incorporated into the main program. Program 
space is not critical in a computer the size of the CDC l60lr, and by 
writing the subroutines as part of the main program, advantage may be 
taken of task specialization. For instance, subroutine R0WSUM computes 



the sum of all M rows of the matrix each time it is called. It takes 
as much time to compute a row sum which is zero as one which is not? 
but we have information which could be used to specialize the routine 
so that it skips over rows whose sum is zero. 

A still better technique would be to write the entire program 
at the assembly language level? especially if the user intends to 
use the program for more than the solution of a few matrices, 

6.1 Results of Using PROGRAM WIDTH . 

If a matrix has a-width^ c 9 and we were to attempt to find the 
a-width by looking at all possible sets of a columns,, then all possible 
sets of o + 1 columns*, and so forth up to all possible sets of e - 1 
columns and finally some sets of e columns s we should have to look at 
X sets of columns for 




We should have to look at this number of sets of columns using the 
branch and bound algorithm also,, if all of the estimates of e which were 
current turned out to be equal. It is conceivable that this could 
happen for some problem? hence we must take (17) as an upper bound 
on the number of branches which must be investigated by the program. 

Now the branch and bound algorithm is not the most efficient way to 
search subsets of columns 5 so we are quite interested in determining 
just how far below the upper bound we can stay by branching and bounding. 
Since we cannot express ary theory to demonstrate the efficiency 
of the algorithm,, the only choice open to us was to solve many problems 
of varying sizes in hopes that trends could be established. It is for 
this reason that subroutines RANDOM and RANDGEN were added to the pro- 
gram. These two subroutines generate matrices of ary size up to lltk 



by lltU • A uniform random number generator is used to generate three 
consecutive random numbers which represent one row of a matrix of lU; 
columns. If a matrix of N columns is desired (N < lU*) bit positions 
N + 1, . .., II 4 .I 4 . of the three word element are set to zero. The number 
of ones remaining in the three words is computed and compared to a user 
supplied argument, NONES. If NONES is less than the remaining ones in 
the three word element, another set of three words is generated, the 
appropriate bit positions cleared to zero, and then the logical product 
of the two elements is taken. This procedure is repeated until MDNES 
is greater than or equal to the number of ones remaining in the three 
word element. Thus MDNES represents the maximum permissible row sum 
of any row in the matrix. The three word element is then assigned as 
a row of the matrix, and the procedure is repeated until an entire 
matrix has been generated. We are thus reasonably sure of a random 
distribution of ones throughout the matrix, and we have some control 
over the density of ones in the matrix. Matrices of any dimension are 
generated in no more than a few seconds. 

Our original plan was to generate and solve five matrices of each 
of 112 sets of dimensions for the matrix. It was felt that such a set 
of matrices would be a statistically significant sample from which 
computation time could be functionally related to such parameters as 
matrix dimensions. Unfortunately, time has prevented the completion 
of this scheme. Hence all remarks that follow in this section are with- 
out statistical significance. 

We have been able to generate and solve over 200 matrices of vary- 
ing dimensions for their 1-width; one being by far the most important 
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value for <*. Dimensions of matrices generated were from the 8 by 9 
problem of section one to matrices of dimensions lUU try 25, and 35 by 
100. Some relatively square matrices of size 50 by U5 are included. 

As is to be expected, computation time varies directly with number of 
branches considered when matrix dimensions are held constant. Let us 
therefore make some remarks about the number of branches considered 
by the program in solving these matrices. 

It is clear that the number of branches is a function of the 
number of columns and of the actual 1-width of the matrix. Not quite 
so obvious is that the number is a function of the number of rows in 
the matrix. However the fluctuations apparent in the number of branches 
is very wide. For instance, for one matrix 260 branches were taken 
while for another, 1536 were taken. Both matrices were of dimensions 
35 by 35, and had a 1-width of seven. It is apparent that other 
factors must be involved. One such factor is the distribution of ones 
in the matrix. One matrix, a Steiner triple system [5], which is a 
matrix which among other properties has all row sums equal and all 
column sums equal; required investigation of 1216 branches before com- 
puting the 1-width as nine. Yet this matrix had only 35 rows and 
15 columns. The symmetry of the matrix made it difficult to weed out 
unprofitable branches. Another matrix of dimensions 50 fcy b5 exceeded 
the capacity of the program storage after 2 l8Lt branches. In every 
case, however, the number of branches were below the upper limit given 
by ( 17 ) - Values seldom exceeded 600 for ary of the matrices. 

Of more practical interest is the time required for computation. 

The CDC l60li has an effective cycle time of b.8 m sec. The longest 
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time required to solve any problem was 36 minutes,, although there were 
problems which had not been solved when the program was stopped by the 
operator after about li5 minutes. The matrix which required 36 minutes 
was of dimensions 50 by and had a 1-width of six. The relationship 
between branches and computation time is rather interesting. Matrices 
of dimensions 125 t ry 25 required a little over one second per branch 
whereas matrices of dimensions 25 by 120 required between two and three 
seconds per branch. This seems to verify that advantage could be gained 
by eliminating the L0GIC5 arithmetic in favor of more efficient methods, 
since the amount of L0GIC5 arithmetic required increases with number of 
columns. 

Computation time was graphed on semi-log paper versus 1) number 
of columns, 2) number of rows, and 3) 1-width of the matrix. Figure 23 
is a graph of time versus number of columns for matrices of 25 and 35 
rows. Figure 2h is of time versus number of rows for matrices of 25 
and 35 columns, and Figure 25 is of time versus 1-width for matrices 
of dimensions 20 by 20. 

From these graphs, it seems reasonable to conclude an exponential 
increase in computation time versus both number of rows and number of 
columns. No hypothesis is made about the parameters of the function. 

Our method of generating matrices degrades the validity of Figure 25. 

In order to create matrices of high 1-width, we can only lower the den- 
sity of ones in the matrix. This, in turn, increases the likelihood 
of rows of sum one? which results in an artificial simplification of the 
problem. This is apparent especially in the case of the matrices of 
1-widths nine and ten in Figure 25. 



58 



Minutes 




Figure 23 



CO'OS 8 'g £ VS S 2 88 g 



Minutes 




Figure 24 



0^1.1 mi») 




eu) 



Figure 25 



The validity and usefulness of the algorithm has been established 
try the above results. Most computation times were under 20 minutes, 
and it is felt that computation times could be reduced more than 2 $% 
by cleaning up the program and doing without the programming convenience 
of TYPE LOGIC!? arithmetic. 

7. Applications and Extensions. 

In this final section we consider applications of the branch and 
bound algorithm to solution of real-world problems, and certain famous 
problems of the mathematical puzzle category. We shall also propose 
certain extensions of the algorithm as presented in section three, which 
enlarge the class of problems which may be solved. Certain of the 
problems may, indeed, be more easily solved by other methods, but they 
are presented here to illustrate the variety of problems which may be 
formulated in terms of finding the C -width of a (0,1) matrix. 

7.1 The Eight Queens Problem . 

A famous mathematical puzzle is the followings place the maximum 
number of queens on a chess board so that no two may attack each other. 

We construct a graph of 6lt nodes, one for each square on the chessboard. 
Connect two nodes if a queen may move from one node to the other. The 
minimal 1-set of the node-arc incidence matrix is a minimal set of 
nodes that touch all arcs of the graph. Now since this matrix has row 
sums which are all equal to two, the sub-matrix consisting of all columns 
not in the minimal 1-set has row sums of at most one. Hence in the 
graph corresponding to this sub-matrix, there is no connection between 
any of the nodes. Thus the complement cf the minimal 1-set of nodes 
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represents square at which the maximum number of queens may be placed 
so that no two may attack each other. This problem is a special case 
of a class of problems which also includes the targeting problem of 
Section one. We next present a description of this general class of 
problem. 

7.2 The Connecting Nodes Problem . 

Find the fewest number of nodes that touch all arcs in a graph. 

Here the rows of the incidence matrix are arcs of the graph, and the 
columns are nodes. The 1 -width of the incidence matrix is the solution 
to the problem. In addition to the targeting problem of Section one, 
another problem of this type is the followings 

Given a communications system of some type (let us say a system 
of highways connecting towns), what is the minimum number of arcs (high- 
ways) which must be kept safe from attack (natural disaster, etc.) so 
that no node is isolated? In order to solve this problem, we construct 
an incidence matrix as followss list the arcs as columns and the nodes 
as rows. Let a-j « 1 if the i^ node is a terminal of the j^" 1 arc. 

Then the 1-width of this matrix is the solution to the problem. 

7.3 Simplification of Logical Functions . 

We present this problem to show the range of interpretations which 
may be made from the concept of a -width. The reader is cautioned, 
however, that the incidence matrix required for using the branch and 
bound algorithm to solve this problem is likely to become prohibitively 
large. 
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Given a truth table for a proposition letter formula, F, in r 



proposition letters, p^, . p r j find a disjunctive normal form for 
F which has the fewest number of terms. If we let represent the 
conjunction operator; "+ ,f represent the disjunction operator; and a 
be the negation of a, then the disjunctive normal form for a proposi- 
tion letter formula is of the forms 

(pj_ & P 2 ) + (P x & P3 & P[P + (P 3 & P^) + .0. 
where the p^ are proposition letters. The expressions enclosed within 
parentheses are called terms. The problem is of interest in switching 
circuits and in the logical design of digital computers. 

For columns of the incidence matrix take all terms having one 

of the forms s q.s q. & q : ; ...; q. & ... & q ? where q is either p„, 

or its negation; and such that the term takes the value "true" only 

if F does also for all values of the p not explicitly present in the 

term. For example, if p^ & p^ is a term of F in three proposition 

letters, p^ p ? , p^j then both Ffp^p^p^) and F^, p^, p ) must be 

true if p & p„ is true. We next construct a row of the incidence 
1 3 

matrix for each ’'true'’ entry of F in the truth table. Place a one in 
the column corresponding to the assignment of values to p^, ..., p 
which makes up the entry in the truth table corresponding to the "true" 
entry of F. Then place ones in all other columns vrhich are also true 
for this assignment of values to the p£. Thus if p^ & p^ & p^ makes 
F true, a row of the matrix would have a one under this column label 
as well as under p^ & p^; p^ & p and so forth. 

As an example, consider the truth table of Figure 26 . The columns 
of the matrix would be labelled '*p^ & P2 ,! ; H p^ & py; “P2 & P3" ;"P]&p2&P3" 
"p^ & p£ & py; "p^ & P2 & py | *'p ^ & P2 & py . The four rows would 
have ones in columns labelled as follows? 
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Figure 26 

Row 1: p 2 & p 5 p t & p 2 & p 3 

Row 2s p l & p 2 ; p x & py p i & p^ & p^ 

Row 3; p x & *p 2 ; P x & V 2 & P 3 

Row ht p x & p $ P 2 & P 3 ; pj_ & P 2 & P 3 

Clearly, p 2 & p 3 and p^ & p 2 are a minimal 
for the matrix, and F = (p^ & p 2 ) + (p^ & p ) is 
normal form. 



1-set of representatives 
a minimal disjunctive 



7.1; The Minimal C-cover Problem. 

It would be quite simple to extend the computer program to solve 
the minimal C-width problem. Essentially all that would be necessary 
is input revision to accept the vector, C, and the initial setting of 
the vector, RT to C. Of course, e could not be calculated, but would 
have to be estimated using exactly the same subroutine which is presently 
used for the situation, a > l. For an entirely different algorithm for 
solving the minimal C-cover problem (and hence also the a-width problem) 
see [8]. The minimal C-cover extension is of interest primarily as a 
first step to a more involved and more useful extension. 
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7.5 A Minimal. Cost C-eover . 

One of the more obvious deficiencies of the solution to the 
targeting problem of section one is that when only one 1-set. is computed* 
that particular set might include a target very heavily fortified 
whereas one not as heavily fortified might have been a member of another 
minimal u-set. One approach to remedy- this deficiency would be to 
compute all minimal ct-setsj and indeed the approach will be mentioned 
subsequently. However* it is also possible that for some variety of 
reasons* it would be preferable even to destroy more than the minimum 
number of targets. The term* preferable* indicates that there might 
be a utility function or cost function associated with the problem. 

The extension cf the algorithm so that it may handle costs 
associated with the columns is perhaps the most interesting extension 
that we shall discuss. The author believes that this extension might 
result in a decrease in the computation time required. The belief is 
based upon the observation that the lower bounds calculated in the pre- 
sent program are relatively close to each other. Thus there is entirely 
too much switching away from one branch* to another* and then back to 
the original branch. With a wide difference among the column costs* 
however* the differences among the various estimates of the C-width 
of the original matrix should be equally wide. This will serve to 
reduce the unnecessary switching from branch to branch. That is* it 
is more likely that when a branch is dropped by the algorithm* it is 
because that branch has become unprofitable. 

Suppose we assign to each column of the matrix a . cost* which need 
not be integral* and may be positive* negative* or zero. Vie would 
then be interested in finding a C-cover (or an a -set) which has 
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minimum cost associated with it. Of course* such a C-cover might 
not be a minimal Ocover as defined previously. 

The modification to the computer program would be surprisingly 
simple. The cost vector would be read in, and let us assume stored in 
COMMON. Now the "infinity** for unfeasible column combinations must be 
increased to some arbitrarily large number. An estimate of e(c) would 
be calculated for each sub-matrix using the same subroutine as present- 
ly used for a > 1. From subroutine BOUND, however, the program 
would enter a new subroutine, such as subroutine COST presented in 
Figure 27. In this subroutine, a cost for the p just computed would 
be estimated. The estimate would be optomistic in the sense that the 
cost for the columns would be the sum of the smallest cost components 
not already used on this branch. For example, consider a cost vector, 
(1,2,3)* Assume that, on the current branch, column one has been 
either included or excluded, and that we have computed p * 1. Then 
the cost for the sub-matrix would be estimated as twoj and the esti- 
mate of the cost for the minimal C-cover would be two plus the cost of 
column one, if column one had been included, or two, if column one 
had been excluded. 

Finally, either in the same subroutine, or in the main program, the 
cost of the set of currently included columns would be computed and 
stored in place of the argument, VCOL(I) of the current program. Also, 
VEPSILON(I) of the current program would be replaced by the sum of 
VCOL(l) and the cost estimate just computed in subroutine COST, as 
described in the above paragraph. 

The reader is reminded that the subroutine COST of Figure 2? 
has not been checked out and that the remarks in this section about 
decreasing total computation time represent merely the author* $ 
intuition and are not based upon observations. 
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SUBROUT INE COST (EPSILON, I) 

COMMON/feLOCKH /CCOST ( lbh) /3LOCKB/lDATA( 1?6 1) 

EQUIVALENCE (MASKC, IMASK(3) ) 

TYPE LOGICS (3) HASH C, BIT 
DIMENSIONS TEMP( II 4 .I 4 .) 

DO 5 j - 1, 3 

5 IHASK(J) « inATA((587*(I“l))+582+J) 

N « IDATA((587*(I-l))+3) 

BIT *» BIT * MASKC 
IF (BIT.EQ.O) 10, 20 
10 K - K ❖ 1 

TEMP(K) « CCOST (3) 

IF (TEMP(K).LT.TEST) 15, 20 
15 TEST « TEMP(K) 

K1 - K 
L » EPSILON 
20 CONTINUE 

IF (K.LT.L) 25, 30 
25 EPSILON «* l.E+20 
RETURN 

30 EPSILON - TEST 
J » 1 

35 J s J + l 

TEST 1 « TEMP(l) 

DO US M « 2, K 

IF (TEMP(K) .GE.TEST -AND „TEMP(M) .LT .TEST 1.AND.K1.NE.M) 1*0, l£ 
liO TEST 1 « TEMP(M) 
li5 CONTINUE 

EPSILON - EPSILON 4- TEST 1 
TEST « TEST 1 
IF (J.EQ.L) 50, 35 
50 RETURN 
END 



Figure 2? 



Once both of the above extensions have been programmed we may use 
the algorithm to solve a large variety of problems which are a sub- 
class of the set of integer programming problems. 



7.6 An Integer Program with a (0,1) Constraint Matrix . 

We merely point out in this section, a formulation of a problem 
which the extended algorithm can solve. Given the system of linear 
inequalities: 



I 



>1 



a. •• x . 2b. 
1 J J i 



1, 2 , » • . , m 



where a., is either zero or one: find values for each x. such that x. 

J J 

is either zero or one, which minimizes: 

,n 



I 



>1 



C j'Xj* 



Here the Cj are costs, and the b. are the components of what has pre- 
viously been called the C vector. 



7.7 Constraints on Combinations of Columns . 

Suppose that upon any of the problems which may be solved by 
extensions of the algorithm, we impose constraints of the following 
types if column a is included in the C-cover, then column b must be 
exc luded. 

We could write a relatively short subroutine to handle this type 
of constraint. It would be necessary to put the constraints in a con- 
venient form, say for each constraint construct a mask of zeros except 
in the bit positions corresponding to columns which cannot be included 
together. For example, let there be five columns and assume two 
constraints: that columns one and two cannot be included together, and 

that columns three and five cannot be included together .Then the two 
masks would be: 
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11000 and 00101. 

We put these constraints into the program in some convenient fashion 
(probably by the same system used for putting the A matrix in the 
current program) ; and write a subroutine to compute the logical pro™ 
duct of each constraint with the mask of included columns. If there 
are no ones in the product for ary constraint, the subroutine must set 
a ’'current 1 ' cost vector equal to the input cost vector. If there are 
two or more ones in ary single product, the subroutine must indicate 
that an infeasible column selection has been made. Finally, for each 
constraint with exactly one "l” in the product, set the ’'current" 
cost of every "1" in the constraint to "infinity"; except of course, 
the "1" representing the current column inclusion. We use the 
"current" cost vector in computing bounds instead of the input cost 
vector. 

7.8 Finding All. Minimal C-covers of the Matrix . 

It is possible to simplify the search for minimal Clovers once 
the first one has been located. The same algorithm applies except that 
we have information to rule out as infeasible, ary set of columns 
which yields an estimate of e(C) larger than the computed C-width. 
Although the simplification would contribute to a substantial savings 
in computation time for each additional minimal C-eover, it is believed 
that for most problems the search for all minimal C -covers would re- 
quire more computation time than the results would warrant. 
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APPENDIX I 



Listing of PROGRAM WIDTH 
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